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The ab-initio crystal structure solution by 
powder diffraction data requires great 
efforts because of the collapse of the 
experimental information onto the one 
dimensional 29 axis of the pattern. 
Different strategies will be described aim- 
ing at improving the process of extraction 
of the integrated intensities from the 
experimental pattern in order to make 
more straightforward the structure solution 
process by direct methods. Particular 
attention will be devoted to the EXPO pro- 
gram. Some of its performance will be 
analysed and results will be shown. 
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1. Introduction 

Two approaches can be followed for solving ab-ini- 
tio crystal structures by powder diffraction data: the 
traditional approach and the direct space approach. 
Both of them require the knowledge of the following 
minimal information: a) experimental diffraction pro- 
file; b) cell parameters; c) space group; d) unit cell con- 
tent. In addition, the direct space methods (simulated 
annealing, genetic algorithm, Monte Carlo techniques) 
need the knowledge of the structure molecular geome- 
try also. When this information is not available, the tra- 
ditional method is the obligatory choice. 

The main steps of the traditional approach consist of: 
1) indexing the powder pattern; 2) determining the 
space group; 3) solving the structure by direct methods, 
Patterson methods or Maximum Entropy; 4) refining 
the structure by the Rietveld method. 

About point 3), direct methods are based on statisti- 
cal and probabilistic calculations. They use the experi- 
mental \Fi\ structure factor modulus corresponding to 



each h reflection (see Ref [2] for direct methods theo- 
ry) and aim at solving the phase problem. The inverse 
Fourier transform of the F/, structure factors provides 
the electron density map whose maxima correspond to 
the atomic positions. Direct methods are successfully 
applied to single crystal data. In case of powder solu- 
tion, the extraction of the integrated intensity 4 (^ oc 
\Ff^f) for each reflection from the experimental pattern 
is preliminary to the application of direct methods. It 
requires the decomposition of the experimental profile 
into the single peaks and the area under each peak gives 
the wanted 4 value. Unfortunately, some problems 
occur and make the extraction procedure very critical 
for the success of the structure solution step. The main 
problems regard: 

a) the peak overlap. It depends on the experimental 
resolution and on the structure complexity It 
increases at large 20 angular values of the observed 
pattern; 

b) the background. Its estimate is not trivial. It adds to 
the peak overlap effect so that, especially at large 20 
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values, it is difficult to estimate the noise contribu- 
tion correctly; 
c) the preferred orientation. The crystallities are not 
always randomly oriented. This behaviour modifies 
the ratios of the experimental intensities. 
For the above mentioned problems the l^^^l estimate 
process reveals itself as a crucial point in the powder 
ab-initio solution: the more reliable the extracted inte- 
grated intensities values, the larger the success proba- 
bility of solving the structure. 



^ ^3F\^ 



where the summation is over the number of reflections, 
l^ftlextmcted i^ the structure factor modulus extracted by 
one of the two methods and l^/iLe is the structure fac- 
tor modulus calculated by using the published atomic 
parameters. The reliability parameter R^ about the pro- 
file: 



2, The Integrated Intensity Extraction 
Process 



^ ^ ^bobs(0|-|>^calc(OI 
^Ij^obsCOl 



Two methods are widely used for extracting the inte- 
grated intensities from the powder pattern: the Pawley 
method [3] and the Le Bail method [4]. 

The Pawley method is based on a non linear least 
squares procedure. The integrated intensities are refin- 
able variables in addition to the profile parameters. 
Because of the peak overlap, the least squares are often 
unstable and they provide negative integrated intensity 
values which must be discarded. For this reason the 
method needs positivity constraints [5], [6]. 

The Le Bail method is an iterative decomposition 
algorithm following the Rietveld formula [7]. The inte- 
grated intensity value is calculated according to: 



^=X(>'obs(0->'b(0)*>'calc(^"^^)/X-K.alc(^^)- 



where the summation is over the peak range, j^obsCO is 
the experimental count in the 2Qi angular value, y^{i) is 
the background contribution, y^^J^i, h) is the calculated 
count in IQ^, due to the h reflection contribution. The 
Le Bail method starts with arbitrary but fixed integrat- 
ed intensity values and the formula is cyclically 
applied. It is rapidly convergent; it provides positive 
values if the background is properly estimated, but it 
tends to equiportion the intensity of a group of reflec- 
tions strongly overlapping. 



is considered also. The summation is extended to the 
number of profile counts, j^obsO) ^^^ J^caicCO ^^^ the 
observed and the calculated counts, respectively. 

In Table 1 , crystal chemical information are given for 
some test structures (the code name, the space group, 
the unit cell content, the 2^ experimental range and the 
number of reflections in the range). They cover a quite 
large variety of cases. 

In Table 2, for each of some test structures, the R^ 
and the R^ values are shown. They are calculated by 
using the integrated intensities extracted by the 
EXTRA program [8] and the ALLHKL program [3], 
respectively. EXTRA is a Le Bail based package, 
ALLHKL uses the Pawley method. In Table 2 the R^ 
values are small but the R^ values are large (0.4 is the 
average value). This means that: a) low R^ value is nec- 
essary and not sufficient condition for a reliable extrac- 
tion; b) the integrated intensity accuracy is very low 
and this is the reason for which the powder ab-initio 
solution is not straightforward. Moreover, the R^ values 
by EXTRA are always smaller than the R^ values by 
ALLHKL so that we could conclude that the Le Bail 
method should be preferred but that behaviour may 
depend on the equipartition tendency of the Le Bail 
approach. However we proved that the statistical effi- 
ciency of direct methods improved by using the Le Bail 
extracted intensities [9]. 



3, The Direct Methods Efficiency With 
Powder Data 

In order to assess which of the two above mentioned 
methods is more suitable to be combined with direct 
methods, it proves useful to take into account the relia- 
bility parameter Rp about the extracted amplitudes: 



4, The Le Bail Method Advantage 

The Le Bail method offers a great advantage: it is 
very sensitive to the starting point. This aspect is shown 
in Table 3 where, for some test structures, the R^ values 
are shown. Protocol 1 corresponds to the traditional Le 
Bail and Pawley extraction cases respectively, while 
Protocol 2 corresponds to the case when the true inte- 
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Table 1. Code and crystal chemical information for the test structures. X: home diffractometer data; N: neutron data; S: synchrotron data 



Code 


space Group 


Unit cell content 


2?? Range 


Nr. reflection 


AGPZ (X) 


Pbca 


Ag8Ni6C24H24 


5.0-80.0 


258 


ANDl (S) 


Pl^ln 


^28^20^8^44 


4.0-50.0 


896 


BACO(N) 


Cllm 


Ba4C802oD8 


20.7-150.0 


272 


BAMO (X) 


PI, 


Ba4Moi204Q 


10.0-119.0 


1220 


BENZ (S) 


Ply! a 


C24F12 


5.0-100.0 


716 


CF3BR (N) 


Pl^la 


C4Br4Fi2 


6.0-150.0 


375 


CFCL (N) 


Fddl 


^8^" 16*^1 16 


5.0-150.0 


203 


CFI(N) 


Cmca 


^8^2418 


10.0-150.0 


428 


CIME (S) 


pyn 


S4C4oN24Hg4 


8.01-84.99 


924 


CROX (X) 


p 1 


Cr802i 


6.0-80.0 


657 


CUPZ (X) 


Pbca 


^^^le^i^iA 


5.0-80.0 


243 


DADA (X) 


Plylyly 


TigK4Sii204o 


10.0-95.0 


518 


EMT (S) 


P 63/m m c 


(Si,Al)96Na28O204 


4.5-63.0 


670 


GAPO (S) 


Pbca 


Ga32P320i28F8C56 


7.0-63.69 


1235 


LAMO (X) 


Pl^la 


La4M02o032 


11.0-69.0 


271 


LASI (N) 


Pl^lc 


LagSig028 


10.-115.72 


253 


LEV (S) 


Khm 


[Si540io8]3C8NHi6 


8.0-85.6 


323 


MCM (S) 


P 61m m m 


Si720i44 


2.2-50.0 


480 


MESPC) 


Plylc 


C24N4O20S4H52 


5.0-88.0 


719 


METYL (S) 


1222 


Nai6Ci6H48 


5.2-70.0 


318 


NBPO (S) 


Cllc 


Nb2oOi2(>P28 


3.0-60.0 


1201 


NIZR (S) 


Pl^ln 


Ni4Zr8P40i6 


8.0-52.0 


627 


PBS (S) 


Pbca 


Pb8Si6024 


7.5-79.8 


477 


SAPO (S) 


Pmmn 


Si32064N2C48 


5.0-79.98 


716 


SBPO (S) 


Pl^in 


Sb8 P14048 


6.0-100.0 


1071 


SOT (S) 


I Alia m d 


Si^O 128*^104 


8.5-92.96 


451 


UTMl (S) 


Cllm 


^144^88 


2.5^9.97 


1133 


VFI (S) 


P63 


^118^18^114 


5.0-90.0 


787 


VNI(S) 


P 422^2 


Rb44K4Si96Zn240288 


5.0-60.0 


1345 


YONO (S) 


PI, 


Y8026N2H18 


7.0-80.0 


680 


YURI (X) 


Plylc 


Na4S40i6Ci2F2 


8.0-63.96 


243 



grated intensities are used as starting values in the Le 
Bail and Pawley methods, respectively. The values in 
Table 3 suggest that the integrated intensity estimate is 
not sensitive to the starting point if the Pawley method 
is adopted. On the contrary, it is not so for the Le Bail 
approach. This means that if the starting integrated 
intensities are less arbitrary and closer to the true ones 
the amplitude estimate is improved. On the other hand, 
the values of Protocol 2 represent the maximal accura- 
cy level we can reach. The great advantage preserved 
by the Le Bail method can be exploited. From this last 
consideration the EXPO program was developed [1]. 



5. The EXPO Program 

EXPO is the integration of EXTRA and SIR- 
POW[10] programs. This last is devoted to the structure 
solution by direct methods. EXPO needs the minimal 
information about the experimental powder pattern, the 



cell parameters, the space group and the unit cell con- 
tent (Fig. 1 shows an example of the minimal EXPO 
input). Its main steps are: 

1 ) Extraction of the integrated intensities (EXTRAC- 
TION routine); 

2) Normalization of the extracted intensities (NOR- 
MALIZATION routine); 

The normalization rule restrains that: 

<|£,P>=1 

where E^ is the normalized structure factor. The 
large l^"! value reflections are statistically meaning- 
ful. The statistical analysis of the normalized struc- 
ture factors can reveal the presence of pseudo-trans- 
lational symmetry and/or preferred orientation. 

3) Calculation of the structure invariant relationships 
(triplets and quartets) (INVARIANT routine); 

The structure invariant statistical reliability is taken 
into account. 
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Table 2. The 7?p and the 7?p values corresponding to the intensities 
extracted by EXTRA and ALLHKL respectively 



Code 




EXTRA 


ALLHKL 




(Le 


; Bail based) 


(Pawley based) 




7?F 


^P 


7?F 


^P 


AGPZ 


0.53 


0.12 


0.60 


0.15 


BACO 


0.35 


0.05 


0.39 


0.05 


BENZ 


0.43 


0.15 


0.81 


0.22 


CF3BR 


0.33 


0.09 


0.41 


0.12 


CFCL 


0.22 


0.06 


0.34 


0.06 


CFI 


0.51 


0.03 


0.78 


0.05 


CROX 


0.39 


0.09 


0.60 


0.13 


CUPZ 


0.50 


0.06 


0.86 


0.11 


LAMO 


0.39 


0.21 


0.46 


0.23 


LASI 


0.40 


0.11 


0.47 


0.11 


LEV 


0.60 


0.05 






MES 


0.50 


0.07 


0.77 


0.06 


METYL 


0.33 


0.10 


0.53 


0.10 


NBPO 


0.43 


0.10 


0.52 


0.18 


NIZR 


0.41 


0.18 


0.61 


0.18 


PBS 


0.43 


0.10 


0.48 


0.09 


SAPO 


0.47 


0.06 


0.87 


0.12 


SBPO 


0.51 


0.08 


0.82 


0.10 


SULPH 


0.35 


0.03 


0.47 


0.08 


YONO 


0.33 


0.10 


0.44 


0.10 



Table 3. For each test structure: a) the 7?p value by EXTRA and 
ALLHKL in a traditional extraction run (Protocol 1); b) the 7?p 
value by EXTRA and ALLHKL by using the true (calculated by the 
published positions) \F\ values as starting point (Protocol 2) are 
shown 



Code 


EXTRA 


ALLHKL 




Protocol 1 


Protocol 2 


Protocol 1 


Protocol 2 


AGPZ 


0.53 


0.26 


0.62 


0.61 


BACO 


0.34 


0.21 


0.39 


0.39 


BENZ 


0.43 


0.22 


0.52 


0.80 


CF3BR 


0.33 


0.15 


0.59 


0.46 


CFCL 


0.22 


0.12 


0.34 


0.27 


CFI 


0.50 


0.32 


0.74 


0.84 


CROX 


0.39 


0.16 


0.55 


0.60 


CUPZ 


0.50 


0.24 


0.86 


0.74 


LAMO 


0.39 


0.22 


0.39 


0.45 


LASI 


0.39 


0.16 


0.45 


0.47 


LEV 


0.60 


0.23 






MES 


0.49 


0.26 


0.73 


0.77 


METYL 


0.32 


0.25 


0.37 


0.54 


NBPO 


0.44 


0.12 


0.52 


1.11 


NIZR 


0.43 


0.24 


0.61 


0.55 


PBS 


0.43 


0.26 


0.48 


0.43 


SAPO 


0.47 


0.19 


0.87 


0.86 


SBPO 


0.51 


0.15 


0.82 


0.79 


SULPH 


0.34 


0.20 


0.44 


0.56 


YONO 


0.31 


0.18 


0.32 


0.42 



%structure cimetidine 

%initiaHse 

%data 

range 8.01 84.99 0.01 

pattern cimetidine. pow 

content s 4 c 40 n 24 h 64 

wave 1.52904 

cell 10.6986 18.8181 6.8246 90.0 111.284 90.0 

space p 21 In 

synchrotron 

%continue 

Fig. 1. EXPO minimal input: an example. 



4) Phasing the reflections (PHASE routine); 
Random phases are given to few pivotal reflections 
and the phase information is expanded to the large 
l^"! value reflections by using the most reliable 
triplets. The phasing trial corresponding to the 
largest combined figure of merit (CFOM) is select- 
ed. 

5) Calculation of the Fourier map (FOURIER routine). 
The selected phases are used for calculating the 
Fourier map whose maxima are searched and chem- 
ically interpreted. The map is optimized by combin- 
ing successive structure factor calculations with 
preliminary least squares cycles. 

Therefore, EXPO is a program able to reach the struc- 
ture solution starting from minimal experimental infor- 
mation. Thanks to the Le Bail tendency to be very sen- 
sitive to the starting point, EXPO is more than the triv- 
ial combination of the two programs. It is able to 
exploit information becoming available during the 
structure solution process itself in the extraction routine 
to improve the structure factor modulus estimate. 



6. The Use of Prior Information 

The following types of information provided by the 

solution process can be used as prior information for 

improving the extraction of the integrated intensities: 

a) Pseudo-translational symmetry information [11]. 

When a structure is affected by pseudo translational 

symmetry, a percentage of its electron density 

repeats itself after an u vector shift. This means that 

p^ir) = pir+u) 

where p^ is a /? percentage of the electron density 
and u is the pseudo-symmetry vector. In EXPO, the 
statistical l^"! value analysis is able to reveal the pres- 
ence of pseudo-symmetry, to recognize the percent- 
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age (the FSP fractional scattering power) and the 
type (the u vector). If this pseudo-symmetry occurs, 
an af, coefficient is associated to each h reflection so 
that, if af, is equal to zero, the reflection is said to be 
a superstructure reflection, on the contrary it is a 
substructure reflection. In the pseudo-symmetry 
case, the normalization rule is violated and 

<\E,\'>=\+ia,-\yFSP, 

This statistical information can be exploited in a 
successive intensity-recycled Le Bail extraction. In 
this case, the starting casual integrated intensities are 
modulated by the statistical term in the previous for- 
mula [1 + («/,- lyFSP], So doing, the substructure 
reflection intensities are increased and the super- 
structure reflection intensities are decreased. The 
new intensity estimates are more accurate than the 
traditional Le Bail extraction ones and the phasing 
process gives better results. 

b) Probabilistic estimate information [12]. 

In the INVARIANT routine, EXPO is able to pro- 
vide the probabilistic estimate of the structure factor 
modulus (the positivity condition of the electron 
density is considered in the reciprocal space) by 
using triplet relationships both in the centric case 
and in the acentric case. A intensity-recycled Le Bail 
extraction can be carried out by exploiting the 
amplitude statistical estimates as starting values. 

c) The Patterson information [13]. 

EXPO is able to calculate a Patterson map by using 
the extracted integrated intensities from a traditional 
Le Bail extraction. The map is modified (the origin 
peak is reduced and the low intensity points are put 
to zero). After that, the map is inverted. The thus 
obtained squared structure factor moduli can be 
exploited as starting point in a new Le Bail extrac- 
tion. 

d) The located fragment information [14]. 

If a traditional Le Bail extraction EXPO run is able 
to locate a fragment in correct way the structure fac- 
tor moduli calculated by taking into account the 
recognised atomic positions can be used as starting 
point in a intensity-recycled Le Bail process. 
We can summarise that the Le Bail potential to be 
sensitive to the starting point can be exploited by con- 
sidering different kinds of prior information to make 
the starting point closer to the true one. In this way, the 
extraction is more efficient and the structure solution 
results become more reliable. In Table 4, the results 
concerning the use of prior information are shown. The 
7?F value corresponding to the traditional Le Bail 



Table 4. Rp reliability parameters (xlOO). T?^ is in traditional Le Bail 
extraction case; ^pseud is in the prior pseudo-symmetry information 
case; ^patt is in the prior Patterson information case; ^prqb is in the 
prior probabilistic estimate information case, ^true when the true 
structure factor moduli are used 



Code 



^r 



^P 



^P 



^P 



7?x 



AGPZ 


52.28 


33.51 


38.93 


47.05 


24.18 


BACO 


31.32 




28.09 


28.22 


16.94 


BENZ 


41.42 




35.83 


36.81 


21.28 


CF3BR 


29.89 




27.55 


27.43 


10.95 


CFCL 


21.14 




15.64 


19.29 


9.99 


CFI 


49.29 




45.04 


46.23 


30.25 


CROX 


36.65 




33.11 


31.06 


15.71 


CUPZ 


47.07 


34.79 


34.49 


41.21 


21.90 


LAMO 


35.59 




34.99 


35.11 


25.93 


LASI 


37.81 




37.07 


35.68 


12.99 


LEV 


58.60 




51.70 


55.70 


22.38 


MES 


46.39 




44.08 


42.09 


25.65 


METYL 


28.98 




27.13 


27.31 


22.74 


NBPO 


38.95 


32.73 


24.36 


29.85 


8.27 


NIZR 


42.17 


41.65 


37.07 


36.06 


21.25 


PBS 


40.82 




38.10 


38.32 


26.50 


SAPO 


45.33 




41.09 


41.77 


17.37 


SBPO 


48.56 


30.85 


28.88 


31.69 


13.17 


SULPH 


32.53 




27.42 


30.12 


19.31 


YONO 


31.95 




27.42 


25.54 


16.75 



extraction run (7?d) and to the use of pseudo-symmetry 
information (^pseud). Patterson information (Rpajt) and 
probabilistic estimate (^prob) are given. The last col- 
umn corresponds to the use of the true intensities to 
start the Le Bail algorithm. The results in that table 
show that the use of prior information decreases the 7?f 
values respect to the traditional Le Bail extraction case, 
making them closer to the values in the last column. 
The pseudo-symmetry information can be applied if it 
is revealed. In Table 5, the R^ values obtained by using 
the fragment information (7?frag) with the traditional Le 
Bail extraction 7?^ value and the selected fragment in 
the asymmetric unit (in parentheses the corresponding 
percentage) are given for some test structures, confirm- 
ing the advantage in exploiting prior information. 
Therefore, the use of prior information can help when 
the obtained traditional Le Bail extraction solution is 
not reliable. The following suggestions can be taken in 
consideration for optimise its use and for avoiding the 
bad combined use of prior information because of their 
correlation: 

1) if pseudo-symmetry is revealed, and especially 
when the detected percentage is large, it is conven- 
ient to use it; 

2) if no pseudo-symmetry effect is detected, but the 
structure contains heavy atoms, the use of Patterson 
information can improve the results; 



129 



Volume 1 09, Number 1 , January-February 2004 

Journal of Research of the National Institute of Standards and Technology 



Table 5. For some test structures: the selected fragment and the cor- 
responding percentage, the traditional Le Bail extraction 7?p value 
(7?j)) (xlOO) and the R^ (xlOO) value when the fragment prior infor- 
mation is used (^frag) ^^^ given 



Code 



Selected fragment (%) 



^n 



^, 



FRAG 



AGPZ 1 Ag (97.2 %) 51.12 29.47 

BAMO 2 Ba (37.8 %) 42.35 38.06 

CUPZ 1 Cu (92.3 %) 46.98 26.53 

DADA 1 Ti 2 K (54.4 %) 33.65 31.92 

LAMO 1 La 2 Mo (55.8 %) 35.14 32.75 

LASI 2 La (34.1 %) 37.68 33.55 

NBPO 3 Nb (82.7 %) 40.19 23.99 

NIZR 2Zr(68.6%) 41.85 36.09 

SBPO 2 Sb (87.9 %) 49.52 23.25 

YONO 4 Y (95.8%) 31.93 20.15 



3) the probabilistic estimate information can be used in 
all the cases; 

4) if a fragment is located it can be exploited. 

The structure solutions supplied by EXPO are shown 
in Table 6, where for each test structure we have: the 
maximum (sinO/Xf value, the number of reflections, 
the corresponding number of independent observations 
(see [15] for details), the number of atoms to find in the 
asymmetric unit and the number of atoms found by 
EXPO (in a traditional Le Bail extraction or intensity- 
recycled run). Most of the structures are completely 
solved. This doesn't occur when the data quality is poor 
(small (sinO/Xf value and/or a large overlapping 
degree). 



7. The Random Approach 

When no prior information is available, or when it is 
poor, a recently developed procedure can be attempted 
[16]. It is based on a random approach and it works so 
that, for each cluster of overlapping reflections, some 
random partitions of the cluster overall intensity are 
considered. The partition corresponding to the best fit 
(the lowest Rp value) in the cluster local range is select- 
ed as the most reliable one and it provides the integrat- 
ed intensity values to use as starting ones in the Le Bail 
formula. The random procedure is applied before each 
Le Bail cycle. The merit of the new approach is to 
break the Le Bail tendency to equipartition the intensi- 
ty of a group of overlapping reflections. Its aim is to 
modify the equipartitioned intensities: a necessary goal, 
if the modified intensities correspond to the pivotal 
reflections in the phasing process. The results of the 
random procedure are shown in Table 7, where the 
phase error in the traditional Le Bail extraction case 



Table 6. For each test structure: the maximum (sinO/X) , the num- 
ber of reflections, the number of independent observations, the num- 
ber of atoms to locate (NATSl) and the number of atoms correctly 
located by EXPO in the asymmetric unit (NATS2) are given 



Code 



(sinOay 



M 



Mu 



ind 



NATSl 



NATS2 



AGPZ 

BACO 

BAMO 

BENZ 

CF3BR 

CFCL 

CFI 

CIME 

CROX 

CUPZ 

DADA 

LAMO 

LASI 

LEV 

MES 

METYL 

NBPO 

NIZR 

PBS 

SAPO 

SBPO 

SULPH 

YONO 



0.17 
0.26 
0.32 
0.30 
0.25 
0.37 
0.37 
0.19 
0.21 
0.17 
0.23 
0.13 
0.13 
0.19 
0.20 
0.27 
0.25 
0.18 
0.27 
0.17 
0.28 
0.26 
0.27 



258 
272 

1220 
716 
375 
203 
429 
924 
657 
243 
518 
271 
253 
323 
719 
318 

1201 
628 
477 
717 

1071 
220 
680 



72 
127 
396 
258 
141 
106 
149 
484 
202 

72 
197 
126 
105 
103 
229 
169 
481 
239 
179 
183 
337 

93 
203 



6 
6 
28 
9 
3 
3 
3 

17 

15 

6 

16 

14 

11 

17 

13 

5 

22 

18 

6 

21 

17 

3 

18 



4 
completed 

27 
completed 
completed 
completed 
completed 
completed 
completed 

5 

completed 

12 



11 

completed 

completed 

11 

5 

9 

13 

completed 

completed 



Table 7. For some test structures the phase error (the difference 
between the direct methods phases and the true phases) correspon- 
ding to the EXPO traditional Le Bail extraction run (ERRl) and to 
the use of the random procedure (ERR2) are given 



Code 



ERRl (°) 



ERR2 (°) 



ANDl 

DADA 

GAPO 

LEV 

UTMl 

YURI 



29.87 
48.33 
37.18 
73.55 
66.80 
28.24 



20.68 

23.32 
29.90 
32.53 
23.02 
22.50 



(ERRl) and in the random case (ERR2) are given for 
some test structures. The values corresponding to 
ERR2 are always much better than ERRl. This means 
that the power of the new procedure, to modify a small 
number of reflections that are very important in the 
phasing process, remarkably improves the phasing 
process, even though, on average, no more accurate 
structure factor moduli estimates are obtained. 
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8. The POLPO Procedure 

The solution provided by direct methods is frequent- 
ly incomplete. In particular, this happens in the case of 
heavy atom structure when the heavy atoms are easily 
located, but the light atoms are hardly recognised. The 
traditional approach for completing a partial solution 
consists of combining Fourier map calculations with 
Rietveld refinement. The trend is not trivial, not auto- 
matic, not fast. The new POLPO procedure [17] has 
been introduced in EXPO for completing the structure 
when the structure cations are located. The procedure 
uses the polyhedral information and it is based on the 
Monte Carlo technique. The starting point is the cation 
positions supplied by direct methods. The user gives 
the polyhedral information by using directives about 
the polyhedron type, the corresponding cation label, the 
expected polyhedral average distance, the distance tol- 
erance and the angle tolerance. The procedure automat- 
ically calculates the cation connectivity [17]. Several 
configurations obeying the requested polyhedral and 
connectivity rules are built. The geometrical construc- 
tion takes into account the tolerance about the distances 
and angles. Some configurations are rejected because 
they are chemically inconsistent. Among the remaining 
possible configurations, the model corresponding to the 
best fit between the observed and the calculated profile 
(the lowest Rp value) is selected. Table 8 shows the 
POLPO results: the number of feasible obtained solu- 
tions, the lowest Rp value corresponding to the chosen 

Table 8. For each test structure the number of feasible solutions 
(NFS), the best R^ value, the number of located anions (NA) respect 
to the true number (in parentheses), the average distance from the 
published atomic positions <^ and the CPU time are given 



Code 


NFS 


^P 


NA 


<d>ik) 


CPU time^ (s) 


CROX 


5 


0.22 


11(11) 


0.24 


33 


EMT 


2 


0.12 


12 (12) 


0.23 


103 


MCM 


3 


0.23 


15(13) 


0.23 


257 


NIZR 


2 


0.22 


13 (12) 


0.23 


34 


SAPO 


6 


0.29 


10(10) 


0.27 


38 


SGT 


6 


0.49 


11(7) 


0.45 


98 


UTMl 


9 


0.23 


15(13) 


0.34 


94 


VFI 


8 


0.25 


14 (14) 


0.37 


81 


VNI 


4 


0.19 


35 (30) 


0.24 


181 


^ Compaq^ personal Workstation 500au. 







Certain commercial equipment, instruments, or materials are iden- 
tified in this paper to foster understanding. Such identification does 
not imply recommendation or endorsement by the National Institute 
of Standards and Technology, nor does it imply that the materials or 
equipment identified are necessarily the best available for the pur- 
pose. 



model, the number of anions located in the asymmetric 
unit (in parentheses the true number), the average dis- 
tances between the POLPO positions and the true ones 
and the CPU time are given. It can be seen that all the 
structures are completed in few time. The discrepancy 
with regard to the number of anions depends on the 
imperfectly located positions of the starting cations and 
on the fact that the construction by POLPO is carried 
out in a geometrically perfect way. The POLPO proce- 
dure is currently being enhanced with the aim of com- 
pleting a structure when only some cations are posi- 
tioned. 



9. Conclusions 

Thank to its graphical interface, EXPO is a very 
user-friendly program. It is able to give different oppor- 
tunities for overcoming the difficulties in solving ab- 
initio crystal structures by powder diffi*action data. The 
next version of EXPO will include N-TREOR [18], a 
modified and updated version of the program for index- 
ing TREOR90 [19], the POLPO procedure and new 
strategies for optimising the Fourier map. 
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